Evaluation of Resource Description Quality Measures

نویسندگان

  • Mark Baillie
  • Leif Azzopardi
  • Fabio Crestani
چکیده

An open problem for Distributed Information Retrieval is how to represent large document repositories (known as resources) efficiently. To facilitate resource selection, estimated descriptions of each resource are required, especially when faced with non-cooperative distributed environments[1]. Accurate and efficient Resource description estimation is required as this can have an affect on resource selection, and as a consequence retrieval quality. Query-Based Sampling (QBS) has been proposed as a novel solution for resource estimation[2], with proceeding techniques developed therafter[3]. However, the challenge to determine if one QBS technique is better at generating resource description than another is still an unresolved issue. The initial metrics tested and deployed for measuring resource description quality were the Collection Term Frequency ratio (CTF) and Spearman Rank Correlation Coefficient (SRCC)[2]. The former provides an indication of the percentage of terms seen, whilst the later measures the term ranking order, although neither consider the term frequency, which is important for resource selection. We re-examine this problem and consider measuring the quality of a resource description in context to resource selection, where an estimate of the probability of a term given the resource is typically required. We believe a natural measure for comparing the estimated resource against the actual resource is the Kullback-Leibler Divergence (KL) measure. KL addresses the concerns put forward previously, by not over-representing low frequency terms, and also considering term order[2]. In this paper,

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Feasibility Study of Resource Description and Access (RDA) Implementation in Manuscripts’ Bibliographic Records in Iran

This study was conducted to investigate Feasibility of Resource Description and Access (RDA) implementation in manuscripts’ bibliographic records.Paper type: This research is a practical (applicable research)The present research is based on the Research and Development based on documentary and the comparative approach. Findings: The findings prove that out of the identified el...

متن کامل

Resource Based View of the Firm as a Theoretical Lens on the Organisational Consequences of Quality Improvement

Evaluating the investment that healthcare organisations make in quality improvement requires knowledge of impact at multiple levels, including patient care, workforce and other organisational resources. The degree to which these resources help organisations to survive and thrive in the challenging contexts in which healthcare is designed and delivered is unknown. Investigating this question fro...

متن کامل

Empirical evidence for the validity and reliability of resource-use measures based on patient recall: a systematic review

Background Accurate measurement of resource use is required for economic evaluations alongside clinical trials. Despite patient questionnaires commonly being employed, concerns over data quality persist, and there is little certainty about best practices. This review aims to collate the evidence concerning the validity and reliability of resource-use measures based on patient recall and to aid ...

متن کامل

Measuring the effectiveness of human resource information systems in national iranian oil company an empirical assessment

While the growth of MIS investment and its influence is making MIS evaluation ever more indispensable, little attention has been paid to assessing and communicating system effectiveness. This paper attempts to empirically assess the effectiveness of integrated human resource information system in Iranian oil industry. As suggested by recent research, the widely accepted IS success model is...

متن کامل

Intelligent Support for Resource Quality Evaluation and Description in Health Information Portals

Quality control is a critical issue in online health information provision, where domain experts play an important role in evaluating and selecting quality and relevant resources to meet health consumers’ individual knowledge needs. This research aims for a semi-automated approach to tackle challenges domain experts encounter in resource quality evaluation and description in the context of heal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005